A suprasegmental component in a speech recognition system
نویسندگان
چکیده
منابع مشابه
Suprasegmental duration modelling with elastic constraints in automatic speech recognition
In this paper a method of integrating a model of suprasegmental duration with a HMM-based recogniser at the post-processing level is presented. The N-Best utterance output is rescored using a suitable linear combination of acoustic log-likelihood (provided by a set of tied-state triphone HMMs) and duration log-likelihood (provided by a set of durational models). The durational model used in the...
متن کاملA compact low-cost speech recognition system
A compact low-cost speech recognition system for isolated words is presented. lt yields a recognition rate of more than 95 % in a speaker dependent mode for telephone-quality speech. With little hardware expense a responsetime of less than half a second for a maximum vocabulary of 60 words is obtained. Real-time processing of the speech signal isachieved by restriction to the sign correlation f...
متن کاملA system for audio-visual speech recognition
In this work, a system of audio visual speech recognition will be presented. A new hybrid visual feature combination, which is suitable for audio -visual speech recognition was implemented. The features comprise both the shape and the appearance of lips, the dimensional reduction is applied using discrete cosine transform (DCT). A large visual speech database of the German language has been ass...
متن کاملA Microphone Array System for Speech Recognition
In the past year, we have studied the microphone-array placement problem and come up with some optimal placements for a linear microphone array. In so doing we have developed a new method for general nonlinear optimization which we call the Stochastic Region Contraction method. This allowed us to get optimal solutions to our problem -globally optimal -in far less time than simulated annealing w...
متن کاملA Speech Recognition System for Urdu Language
This paper investigates use of a machine learnt model for recognition of individually words spoken in Urdu language. Speech samples from many different speakers were utilized for modeling. Original time-domain samples are normalized and pre-processed by applying discrete Fourier transformation for speech feature extraction. In frequency domain, high degree of correlation was found for the same ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Journal of the Acoustical Society of America
سال: 1982
ISSN: 0001-4966
DOI: 10.1121/1.2019819